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Abstract 

We prove a new upper bound for diagonal two-colour Ramsey num- 
bers, showing that there exists a constant C such that 

r(k + l,k + l) < k °io g iogf=| 



1. Introduction 

The Ramsey number r{k, I) is the smallest natural number n such that, 
in any red and blue colouring of the edges of the complete graph on n vertices, 
we are guaranteed to find either a red or a blue K[. 

That these numbers exist is a consequence of Ramsey's original theorem 
[R30], but the standard upper bound 

'k + r 



r(k + 1,1 + 1) < 



k 



is due to Erdos and Szekeres [ES35]. 

Very little progress was made on improving this bound until the mid- 
eighties, when a number of successive improvements were given, showing that, 
as expected, r{k + 1,/ + 1) = o(( k ^ 1 )). Firstly, Rodl showed that for some 
constants c and c' we have 

c( k+l ) 

r(fc + U + l)< c ) kJ 

log c (k + 1) 

This result was never published, but a weaker bound, 

6 rk+i\ 

r(k + 1,1 + 1) < 1 '' ' 



log log(fc + 1) 

appears in the survey paper concerning Ramsey bounds by Graham and Rodl 
[GR87]. 
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Not long after these bounds were proven, Thomason [T88] proved that 
there was a positive constant A such that, for k > I, 

f k + r 



r{k + U + 1) < exp j-i^-logfc + Av^Qgfcj 



k 



this being a major improvement on Rodl's bound when k and I are of approx- 
imately the same order, implying in particular that 

r (jfe + 1, k + 1) < k -l/2+A/^S£k^ 
In this paper we will show how to improve on Thomason's result, obtaining 



Theorem 1.1. There exists a constant C such that 

k 



r(k+l,k + l)<k c i°J°gk 



In particular we have the following natural extension of Thomason's The- 
orem: 

Theorem 1.2. For all s > there exists a constant C s such that 
r(Jfe + l,fc + l)<^( 2 *' 

2. An Outline of the Proof 

Our argument (and also Thomason's) begins by assuming that we are 
trying to prove a bound of the form r(k + 1,1 + 1) < f(k,l){ k ^ t ), where 
f(k,l) is some slowly changing function in k and I. In order to construct 
an inductive argument we will assume that for some such function we have 
r(a + 1,6+1) < f(a, b) ( a + 6 ) whenever a is less than k or b is less than I, and 
that we would like to show that the same holds for a = k and b = I. 

To this end, let us suppose that n = [f(k,l)( k ^ l )\ = f*(k, I) , say. 
Then by the argument that proves the Erdos-Szekeres inequality 

r(k + 1,1 + 1) <r(k,l + l) + r(k + l,l), 

we see that, within any red/blue colouring of the edges of K n which does not 
contain a red K^+i or a blue K\ + \, every vertex x can have red degree at most 
r(k, I + 1) — 1 and blue degree at most r(k + 1,1) — 1. Therefore, if d x is the 
red degree of the vertex x (so that n — 1 — d x is the blue degree) , 

d x <r(k,l + 1) 

'k + l-l s 
k-1 
/(A; -1,0 k 



<f(k-l,l) 



f*(k,l) k + l 



n. 
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Similarly, we may use the fact that n — 1 — d x < r(k + 1, 1) — 1 to show that 



, . , , f(k,l-l) I , 



Now, note that if / were always one, then we would know that d x was less 
than j^pjn for each vertex x and also that it was greater than or equal to jr^n, 
a contradiction which is equivalent to the Erdos-Szekeres argument. 

If, instead, we allow the size of f(k,l) to change with both k and I, albeit 
slowly, then we find that for each vertex x the red degree d x is not much greater 
than -j^jn nor much less than it. So we find that the graph is approximately 
regular in degree, the proximity to true regularity being dependent upon how 
slowly f(k,l) changes. 

This approximate degree-regularity is not however the only structural in- 
formation that we have about graph colourings which contain neither a red 
-Kfc+i nor a blue Ki + \. We also know, for example, that in such a graph any 
red edge can lie in at most r(k — 1,1 + 1) — 1 red triangles, and if the vertices 
of this red edge are x and y then there are at most r(k, I) — 1 vertices which 
are connected to x by a red edge and y by a blue edge. If we let d xy be the 
number of vertices which are connected to both x and y by a red edge, then 
these two conditions are enough to tell us that 

2 



dxy ~ I k + l 1 "' 



the exact proximity being again dependent upon the rate at which f(k,l) 
changes. That is, providing that we don't try and improve too much on the 
Erdos-Szekeres bound, we can conclude that across any red edge we have ap- 
proximately the expected number of red triangles (that would be in a random 
graph formed by choosing red edges with probability jjzj)- As a consequence, 
we see that across any red edge there are approximately the expected number 
of red C4S of which the red edge is a diagonal. Importantly, this latter result 
is not restricted to red edges alone - it is straightforward to use the degree- 
regularity conditions and the analogous condition that we have approximately 
the expected number of blue C4S across a blue edge in order to show that we 
have approximately the expected number of red C4S across that edge as well. 

At this stage it is appropriate to recall (at least roughly) the definition of 
quasirandomness: a regular or approximately regular graph is called quasiran- 
dom if it contains approximately the expected number of C4S that would be 
in a random graph chosen with the edge probability dictated by the density 
of the graph (see for example [CGW89], [T87]). The standard results of the 
theory imply that if a graph satisfies this criterion then it also satisfies many of 
the properties that are expected with high probability of a random graph. For 
example, and this is what will be important to us, it contains approximately 
the expected number of any small graph. 
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The properties that we now know about a colouring of a K n not containing 
either a red Kk+\ or a blue -fQ+i are enough to tell us that both the red and 
blue components of our colouring are quasirandom, so we see that in such a 
colouring we have approximately the expected number of any small graph in 
either colour. In particular, for any fixed r, we have approximately 

k \G) 



k + l 



ordered red r-tuples (we find it more convenient to count r-tuples rather than 
K r s since we then don't have to worry about multiple counting in our estimates, 
but it might perhaps be best to think of it in terms of counting K r s). 

If this were in fact precise then it would be inconsistent with the fact that 
any red (r — l)-tuple lies in at most r(k — r + 2, 1 + 1) — 1 red r-tuples, since 
this gives an upper bound on the number of red r-tuples of 

k \(^) 



r{k-r + 2,l + l)[ 1 -— l ) n r ~ ] 



which, since 



„ , 1N f(k-r + l,l) k---(k-r + 2) 

is strictly less than the expected number if the rate of change of / is sufficiently 
small. 

It is precisely this contradiction which allows us to prove our result. There 
are of course several technical caveats, the most interesting of which is that, 
in order to derive Theorem 1.1, it is not sufficient to know that the graph 
is simply quasirandom. It is necessary to apply instead our local condition 
that we have approximately the expected number of red C4S across any given 
edge. Theorem 1.2, on the other hand, is derivable from the quasirandomness 
condition alone (though we will not do this here). 

Secondly, the argument as stated above is slightly illusory - in order to 
derive a useful result it is necessary to take into account the fact that a change 
in the number of K r -\s will be reflected by a change in the number of K r s. 
Without doing this, we would be able to do no better than Thomason's result. 

Where, incidentally, do we depart from Thomason's work? His proof is 
essentially the argument given above in the case r = 3. He counts, in two 
different ways, the number of monochromatic triangles within a graph not 
containing a red K^+i or a blue Ki + \, showing that, unless a bound of the 
form 

r(k + 1,1 + 1) < exp j-^logfc + ^Vlogfej ( k * ^ 

held, there would be a contradiction. While his method of finding an upper 
bound for the number of monochromatic triangles is similar to ours above (the 
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number of red triangles across a given red edge is at most r(k — 1, Z + 1) — 1, 
and we know, approximately, the number of red edges), his method for finding 
a lower bound is to apply Goodman's formula 



where by d x we mean the red degree of the vertex x. This formula is only 
dependent upon the degree sequence, and so, knowing that every degree is ap- 
proximately what's expected, we can show that the number of monochromatic 
triangles is approximately what's expected. Our main advance then is to have 
shown how we can use the quasirandomness conditions to circumvent the fact 
that there is no Goodman-type formula for r > 4. 

This discussion raises one further question: why, if Thomason's result 
implies an off-diagonal estimate as well as a diagonal one, and our arguments 
are the natural extension of Thomason's argument, do we not also have off- 
diagonal theorems which include Theorem 1.1 and Theorem 1.2 as special 
cases? The first part of the answer is that for Theorem 1.2 we do, the following 
theorem being our main result in this case: 

Theorem 2.1. Let s and e be fixed positive constants with e < 1. Then 
there exists a constant C Sjt such that for k > I > ek 



However, if we now fix e and let s increase, the best theorem that we can 
deduce from our knowledge of the growth rate of the C Sj£ is 

Theorem 2.2. For all e < 1 there exists a constant C e such that for all 
k and I with k > I > ek, 



Interestingly, as with Theorem 1.2, we only need the ordinary quasiran- 
domness condition and not its local counterpart to derive these results. It is 
only when we make e tend towards 1 as s increases that our local conditions 
become genuinely useful. So the reason why we don't have an off-diagonal 
version of Theorem 1.1 now emerges: our method doesn't allow one. 

We will not prove Theorems 2.1 and 2.2 in this paper, concentrating in- 
stead on the diagonal results. It should however be clear to the reader how we 
can go about changing our main results in order to derive them. 

We begin the proof proper in the next section by considering, more for- 
mally, the various regularity conditions that a graph containing neither a red 
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Kt+i nor a blue Ki + \ must satisfy, and showing what these conditions imply- 
about such a graph. 



3. The Regularity Conditions 

The following notation will prove essential to us in what follows: 

Definition. Suppose we have a red/blue colouring of the edges of the 
complete graph on n vertices, and let V be the set of vertices. Then we define 
the balanced function of the colouring around probability p as the function 
g : V x V -> R with 

g(x,y) = A(x,y) - p, 

where A : V x V — > R is the characteristic function of red edges, that is A(x, y) 
is 1 if there is a red edge between x and y and otherwise. 

Note that normally one chooses the probability p in such a way as to 
make g(x,y) = 0, but for the sake of simplicity in our exposition, we 
will be centring around a probability which is not quite the correct balanced 
probability, but which is very close. 

We will also need to introduce two constants, 7 and 8, which bound the 
growth (or rather fall) of f(k, I) with respect to k and I respectively. Our main 
result in the next section will be an inequality telling us what kind of rate 
of change of f(k,l) is admissible. More specifically, we will assume that we 
have two real numbers 7 and 5 and a natural number n = [f(k,l)( k ^ l )\ = 
f*(k, 0( fe jtO' suc ^ * na * f° r m = 1) 2 and r — 1, each of the inequalities 

r(k + 1 - m, I + 1) < f(k - m, I) \ ~ "' ' ' 



r(k + 1, 1 + 1 - m) < f(k, I - m) 



k — m 
k + 1 — m 
k 



f(k — m,l) _ , , f(k,l — m) „ , 

holds. What we will show (by the counting K r s argument we discussed in the 
last section) is that if k > I, where k and I are sufficiently large numbers of 
approximately the same magnitude, and if 

kl + l6<—-, 

then 

r(k + l,l + l)<f(k,l)( k + l 

The conditions on 7 and 5 essentially amount to 7 and 5 being the partial 
derivatives of 4>(k,l) = — log f(k, I) with respect to k and I respectively. Thus, 
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if we consider the inequality kj + 15 < ^-^j: as a partial differential equation 
(by putting 7 = and 5 = it is easy to see that taking f(k,l) = 

exp{ — log/c} for k > I works as a potential solution. Indeed a more 
careful treatment of this argument, taking into account the fact that 7 and 5 
do not quite equal the respective derivatives, is what will allow us to derive 
our results. 

The specifics of this must, however, wait until later sections. The task at 
hand is show what we can say about large graphs not containing either red 
Kk + is or blue K[ + is. We begin by writing our various regularity conditions as 
constraints on the size of certain products of the balanced function: 

Lemma 3.1. Let k and I be natural numbers, let 7 and 5 be real numbers 
and let n = [f(k,l){ k f)\ = f*(k,l)( k + l ). Suppose that for m = 1 and m = 2 
each of the inequalities 



r(k + 1 - m, I + 1) < f(k - m, I) 



r(k + 1, 1 + 1 - m) < f(k, I - m) 



k — m + 1 
k — m 

k + 1 — m 
k 



f(k-m,l) f(k,l-m) 5 

f*(k,l) ^ l + m ^ ana Si + mo 

holds. 

Then, in any red/blue colouring of K n not containing either a red K^+i 
or a blue K\+\, the balanced function g(x,y) of the colouring around p = 
satisfies 

IS v-^ &7 
n < > g[x,y) < -^—rn 



k + l ~ ia ' ~ k + l 

y 

for all x, and 

J2 9(x, y)g(y, z) < (hy + lS)n + 1 

y [k + iy 

for all x and z with x/z. 

Proof. The first part of the lemma follows from our observation in Section 
2 that for any vertex x in our colouring we have 

d W-i) l \ n<d < /(fe-i,0 k 

V 1 f*(k,i) k+i) n - dx - f*(k,i) k+i n - 

Noting that d x = A(x, y), A(x, y) = p + g(x, y), and applying our assump- 
tions on the growth rate of / gives the required result. To prove the upper 
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bound, for example, note that 
k 



k + 



■jn + ^2g(x,y) = d x 



f(k-jj) k 

\ 71 

~ f*(k,l) k + l 

^ 1 + ^kTi n - 

Subtracting ^nfi from either side then gives the required bound. 

For the second part of the lemma note that no red edge (x, z) can lie in 
more than r{k — 1, 1 + 1) — 1 red triangles. This implies that 

Ax, y)Av, z ) < r(fc - i,z + 1) - 1. 

y 

If we split up the left-hand side we then get, by using the conditions of the 
theorem, that 

p 2 n + p^2g(x,y) + p^g{y,z) + ^g{x,y)g{y, z) < p 2 {l + 2j)n, 
y y y 

and hence by the first part of the lemma 

k 

^2g(x,y)g(y,z) < 2 — - ^ (ley + lS)n. 

The result follows similarly for blue edges, although we need to be a little bit 
careful, since we get two extra degenerate "triangles" (those for which y = x 
or y = z). □ 

In counting the number of red K r s in a given colouring, we will use the 
following notation: 

Notation. Fix a red/blue colouring on K n and let g(x, y) be the balanced 
function of the colouring around probability p. Suppose also that K r is the 
complete graph on the r vertices vi,V2,--- ,v r , with r < n. Then, for every 
subgraph H of K r ,we write 

9H = II 9{xi,Xj), 

Xi,— ,x r (v i: Vj)eE(H) 

where the sum is taken over all r-tuples of vertices in K n (including degenerate 
terms where two or more of the X{ are the same). 

By rights this is a function of n and r as well as H, but we will be 
almost universally consistent about counting K r s within K n s, so these labels 
are essentially redundant. 
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Given this notation, the number of red K r s (or rather red r-tuples) in a 
colouring of K n is given by 

E II A(xi,Xj)= E Yl iP + 9(xi,Xj)) 

Xl,- ,%r (Vi,Vj)eE(K r ) Xi,- ,X r (v z ,Vj )&E(K r ) 

= E p {lhe{H) SH, 

HcK r 

where, again, the sum is taken over all r-tuples of vertices in K n . So in order 
to estimate the number of K r s we will need to be able to estimate gu for 
each and every subgraph H of K r . Almost all of the estimates we will need 
are encapsulated in the next lemma, which shows how we may use our local 
quasirandomness condition to obtain estimates on products of the balanced 
function. 

Utilising the information provided by the previous lemma, we shall now 
assume that we have g(x, y)g(y, z) < vn for all x and z with x/z, where 
v is some positive constant. The next lemma tells us that if H has a vertex 
of degree d then (to the highest order in n) \gn\ < \f2v d l 2 n r . Within the 
statement of the lemma, we will make the simple assumption that v < 1. This 
is not strictly necessary but tidies up the form of the lemma, and as we shall 
see later is trivially satisfied for k and I large. 

Lemma 3.2. Suppose that the balanced function g(x,y) of a red/blue colour- 
ing of a graph on n vertices satisfies 

J2g(x,y)g(y,z) < vn 

y 

for all x and z with x / z, and some fixed positive real v. Then, provided that 
v<\, 

\J2 E 9(V,xi)---9{y,Xd)h{x 1 ,--- ,x c+d )\ < V2v d ' 2 n c+d+1 + \j^- n« 

y xi,---,x c+d v 

for any function h of c + d vertices which is bounded above in absolute value 
byl. ' 
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Proof. For d odd, we have 

l£ £ 9(y,xi)g(y,x 2 )---g{y,x d )h(x 1 ,--- ,x c+d )\ 2 

y Xi,--- ,x c+ d 

< n c+d £ | £ 9{y, xi)g(y, x 2 ) ■ ■ ■ g(y, x d )h(xi, ■■■ , x c+d )f 
xi,— ,x c +d y 

<n c+d £ \^29{y,x 1 )g{y,x 2 )---g{y,x d )\ 2 

xi,— ,x c +d y 

'^C£g(y,x)g(x,y')) d 



= n 

y,y x 

<v d n 2c+2d+2 + n 2c+2d+ \ 

where the remainder comes from the degenerate terms. Since this is less than 
the square of 

2v d / 2 

we are done in this case. 

For d even, the proof is the same until we reach the second last line, when 
we need to estimate 

y,y> x 

To do this we split our sum into two pieces, a set P of edges (y,y') where 
^2 X g(y, x)g(x,y') is positive and a similar set N where ^2 x g(y,x)g(x,y') is 
negative. Then the proof in the odd case tells us, since a sum of squares is 
positive, that 

£ CE9(y,x)9M)) d+1 >- £ C£g(y,x)g(x,y')) d+1 
(y,y')eP * (y,y')eN x 

which implies 

£|£ 5 (y,xMx,y')| d+1 <2 £ (£g(y,x)g(x,y')) d+1 

y,y' % (y,y')£ p x 

<2v d+1 n d+3 + 2n d+2 . 

Finally, applying the power mean inequality, we get 

£(£ 9(y, x)g{x, y')) d < (£ | £ g(y, x)g(x, y')\ d+1 )^ 

y,y< x y,y> x 

< n^(2v d+1 n d+3 + 2n d+2 )^ 

<2v d n d+2 + -n d+1 , 
v 

so we are done in this case as well. □ 
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The result mentioned before the lemma now follows from taking y to be 
a vertex within H of degree d. The function h is then what remains, i.e. a 
certain product of balanced functions, and so satisfies the requirement of the 
lemma. 

Ultimately, as we shall see in the next section, we would like to show that 
as many gu terms as possible vanish to more than the first order in 7 and S. 
While the above results are sufficient to show that this is so when the graph 
H has maximum degree 3 or more, it still leaves a large collection of graphs 
of maximum degree 2 for which we have not reached this bound. The next 
lemma shows, however, that if we use the degree-regularity condition as well 
as the quasirandomness condition, then we have the required bounds except 
in the two cases where H is a K 2 or a K%. 

Lemma 3.3. Suppose that the balanced function g(x,y) of a red/blue colour- 
ing of a graph on n vertices satisfies 

\J2g(x,y)\ < fin 

y 

for all x, and 



^2g(x,y)g(y,z) < 



_ I'n 

y 

for all x and z with x 7^ z, and some fixed positive constants fi and v with 
v<\. Then, fori > 3, 

2 M J+i-2|i/2j 

9{xi,x 2 )g{x 2 ,X3) ■ ■ ■ g{xi-i,xi)\ ^ z/j, n ^ 1 

Xl,— ,Xl 



Y g(x 1 ,x 2 )g(x 2 ,x 3 )...g(x l ^,x l )\ < 2^+ 1 - 2 ^V^j-V + ±^_ n « 

and 



l-l 



2 

I Y 5(2/1,2/2)5(2/2,2/3) •• -g(yi,yi)\ < 2i/L , / 2 V + -n 1-1 . 

yi,-,yi 

Proof. For the first part we simply apply the Holder inequality: 
I Y 9(xi,x 2 )---g(xi- 1 ,xi)\^ 

Xl,— ,x t 

= Yj E^l'^) ^#2,^3,^) ••• 
x 2 ,x 4 ,--- \ Xl / \ x 3 / 

<( Y \Y^^ l/2] ) ( Y \Y^2,x 3 )g(x 3 ,x 4 )\^ 

\x 2 ,x 4 ,--- Xl / \x 2 ,x l ,--- x 3 , 

< (v+1 -n./^LV 2 j- 1 „.)^( 1 + _i_y ,/21 , 
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which implies the result. The second part follows similarly. 



□ 



4. The Fundamental Lemma 



In this section we will prove an extension of a lemma due to Thomason 
[T88] which gives an inequality telling us how quickly our function f(k, I) may 
change. The main idea of our proof is one that we have already seen. Instead 
of counting the number of monochromatic triangles as Thomason did, we will 
count the number of monochromatic K r s (or rather a certain weighted sum of 
the number of red K r s and the number of blue K r s), showing, using the fact 
that our graph must be random-like if it does not contain the required cliques, 
that this is approximately what is expected. On the other hand, we can again 
bound the number of monochromatic K r s above using the following further 
generalisation of the Erdos-Szekeres condition: in a graph not containing a red 
K)~ + i or a blue Ki + \ any red K r ^\ is contained in at most r (k — r + 2, 1 + 1) — 1 
red K r s and any blue K r _i is contained in at most r(k + l,l — r + 2) — 1 blue 
K r s. Then since the number of K r _\s can also be estimated (as approximately 
the expected number) we have an upper bound which we can balance against 
our lower bound. 

Again, as we mentioned in the outline, it will be necessary in the proof to 
take into account the fact that the number of K r s and the number of K r _is are 
not independent of one another, being composed almost entirely of like terms, 
although in different proportions. While most of these terms may be reduced 
to o(l) factors at the outset as being quite unimportant to the argument, the 
terms coming from single edges and triangles, which are the highest order, and 
hence the critical, terms, will be left unestimated until after we have balanced 
the number of red K r s against r(k — r + 2, 1 + 1) times the number of red 
K r -\S. Doing this allows us to reduce the error term coming from the single 
edges from being of the order of r 2 J2 X y 9{ x iV) to being r J2 X y d( x -> v)-> smce 
the single edge terms which occur in counting the number of K r ^\s cancel out 
most of the like terms which we get in counting the number of K r s. Without 
this care, our result would yield no improvement over the old bound. 

Before we begin, we need to present a few more remarks in order to illu- 
minate some of the assumptions of the lemma. What we will prove is that if k 
and I are sufficiently large depending on r, k > I > (l — i) k and 



k-y + IS < 



r-3 l 
2 fc' 



then (given the obvious induction hypothesis), we have 



r(fc + 1,1 + 1) < f(k,l) 
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Now, as at the start of Section 3, we see that with an inequality of this 
form, we expect f(k, I) to be roughly of the form 

exp <^ — - log A; 

or some multiple thereof. One result of this is that we expect both I7I and 
\S\ to be bounded by ^-f^^fr^- Since our eventual hope is to prove that f(k, I) 
has such a form we will in the course of our forthcoming proof, in order to 
simplify the final form of the result, make the assumptions that f(k,l) is at 
the smallest equal to exp{— log At} and that both I7I and \5\ are smaller than 
r logfc^ There is no deep mystery to our using r rather than here. It's just 
neater, and makes the lemma look slightly more digestible. 
We are now ready to begin the formalities. 

Lemma 4.1. Let r be a natural number, let 7 and 5 be real numbers and 
let n = [f{k,l)( k l l )\ = f*(k,l)( k + l ). Suppose that, for m = 1, m = 2 and 
m = 1 — 1, each of the inequalities 

r(k + l-m,l + l)< f(k ni.l) \ ~ '" + ' 



r(k + 1, 1 + 1 - m) < f(k, I - m) 



k — m 

k + 1 — m 
k 



f(k — m,l) f{k,l — m) 

nkj) - 1+m7 and nkj) - 1+m5 

holds. Suppose also that 

1. k>l> (l-i)fe, 

2. I7I and \5\ are both smaller than r^j^- and 

3. f(k,l) > exp{-r^logA}. 

Then there exists a constant c such that if k and I are both greater than r c 
and 

r — 3 I 

ki + is<—- k , 

the inequality 

r(k + l,l + l) <n< f(k,l) 



k + l 
k 



holds. 



Proof. To begin, note that, from Lemma 3.1, in a colouring avoiding red 
Kk+\s and blue ifj+is, we must have that the balanced function g(x, y) satisfies 

IS ^7 

■- -n < > q[x,y) < -n 

k + l ~ ,yj ~ k + l 
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for all x, and therefore, using assumption 2 of the lemma, we have that 

Elog k 
g(x,y)\ < r—^n. 

y 

Also from Lemma 1, note that, since kj + IS < 2 -J 3 -, we have that 

J2g(x,y)g(y,z) < j^~[ n 

y 

for all x and z with x ^ z (we may subsume the 0(1) term into the n term 
for k and I larger than some fixed constant - it is in performing this kind of 
estimate that we will use assumption 3 of the lemma). 

For later brevity we will use the notation that J2 X y9( x >y) = fc+7 n2 > an< ^ 
we will also write Ylxy z g(x,y)g(y, z)g(z,x) = ^j^ 3 - Moreover, we will de- 
note the quantity ^| by v so that 

J2g(x,y)g(y,z) < vn, 

y 

noting that, for k + 1 > r, we have v < 1. 

Recall that the number of r-tuples spanning a red clique is given by 

HcK r 

Our first aim will be to show that the contribution of all terms in this sum 
other than the main term (corresponding to the null set), the edge terms and 
the triangle terms can be made smaller in absolute value than -7^^ — for any 
fixed d, by taking k and / to be larger than r cr for some appropriately large c. 

Let us denote by S the set of subgraphs of K r other than the null graph, 
the edges, and the triangles. We will split this set into two further subsets, S' , 
the set of all subgraphs with maximum degree greater than or equal to 3, and 
S", the complement of this set in S. 

For graphs in S', Lemma 3.2 tells us that, for k and / greater than r cr , 



\ A/2+1 



< n r 



where A is the maximum degree of H, and where c\ depends on and grows 
with c. 

Now, every graph in S" either contains a path of length two or a cycle of 
length 4, in which case we have, from Lemma 3.3, and using our bounds on 
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H = max x | Y^ y g( x , y)\ and u, that 

M < 2.^^ + 2 (^V* 
<-^—n r , 

- r c 2 r k 

or is a product of single edges and triangles, in which case 
\ gH \ < 4r 2 ^n r + Sr/" 1 + 4 (?) " n r ~ 2 

where again C2 is just some constant that grows with c. 

Before we proceed with our estimate we also need to note firstly that the 
number of graphs with maximum degree A or less is at most r Ar and also that 
the maximum number of edges in such a graph is Ar (we may of course divide 
by a 2 here but this is not necessary for our estimates). 

We now have, using the fact that p is between 1/3 and 1/2, that 

E^ ,H Wi<(i) <a E4S« r + E^ 
ks w ks-' k iks- ' k 



2 I r CiAr fc r c 2 r£ 

7 A=3 

<^ n r 

2\ 2 ) r dr k 

for c chosen sufficiently large depending on d. 

So, getting back to our original intentions, we see that the number of 
r-tuples spanning a red K r is greater than or equal to 

fcG) _ fr\ k^)~ l _ fr\ fcG)" 3 _ 1 



(fc + Z)G) \ 2 /(ife + Z)G) ' \ 3 /(fc + i)G)- 2 2G)r*fc 

On the other hand we have that the number of r-tuples with a red K r 
across it is less than the number of (r — l)-tuples with a red K r _\ across it 
times r(k + 2 — r, I + 1). So we have that the number of K r s is at most 




in 7 - 1 



( r _ 1)^-1)^- y " \^(A; + /)(fe + /_ i)...(fc + /_ r + 2) ' ! : v[ ' ~ l/h ] "' 

Now we are going to subtract the lower bound from the upper bound, and 
divide through by n r to get an inequality which must hold if the graph on n 
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vertices contains neither a red Kk+i or a blue Ki + \. In so doing it is necessary 
to use the fact that 

k V' 2 ( (A; — 1) - - - (fc — r + 2) 



> 



> 



k + l) \(k + l-l)---(k + l-r + 2) / 

r - 1 \ lk r ~ 3 2 r 



2 J (k + l)(k + l-l)---(k + l-r + 2) (k + l) 2 
r - 1\ lk r ~ 3 T 



2 /(k + iy- 1 (k + l) 2 
so that all of the second order terms arising from the use of ^ ) 

instead of (jrq^j , other than that coming from the main term, will in fact be 

1/k 2 terms or smaller still. They can therefore be subsumed into the remainder 
term. This yields 

V - A Ik®' 1 , . fc©- 1 (r - 1\ k®~ 3 



(k + l)® +1 (k + l)® V 2 J (k + l)C, 



fcU> 7 l 



-(r-1) 7^ < 



(/c + Z)G) 2®r Dr k 

where -D is again just some constant which grows with c. 

Similarly, if we count blue K r s, though we have to be a little bit careful 
about degenerate terms and the fact that 1 — p may be slightly bigger than 
1/2, we get 

r-l\ fczG)" 1 I®' 1 A" 1 ^ l ®~ 3 

2 J (fc + Z)^ ~ (r_1) (fc + /)a) 5 ~ V 2 ) (k + lp- 2 * 

, I® 5 1 



(k + l)® 2®r Dr k 
Now we take the weighted sum of these inequalities in such a way as to 
make the triangle terms disappear, by adding k®~ 3 times the first inequality 
to jG)- times the second, to get 

1\ (kl)0~ 2 (k 2 -l 2 )(kl)®~ 3 . Ak 3 1 + l 3 b)(kl)®' 3 2kO- 
' +(r— 1) s— (r— 1) t^t < 



r — 
2 



(fc + Z)G) (ife + Z)G) (ife + Z)G) 2®r Dr ' 

Finally, provided that I > (l — k and c has been chosen large enough, we 
see that we can subsume the error term on the right hand side into the first 
term on the left hand side, the first term on the left hand side being then larger 
than 



r- 1\ k®~ 4 



DIAGONAL RAMSEY NUMBERS 



17 



(here we have used the fact that 1 — \ > e 2 / r for r > 2). 
Therefore, subsuming this term, we see that 

(r-l)(r-3) (fcZ)Q- 2 {k 2 - l 2 )(kl)(*)- 3 ^ (fc 3 7 + Z 3 ^)(fc/)(»)- 3 < Q 

2 (k + l)® (k + l)0 (k + l)Q 

Simplifying gives 

~ ^ kl + (fc 2 - l 2 )s - (fc 3 7 + l 3 5) < 0. 



2 

Recall now that 

s = ^ > -Zd, 

so therefore, since k > I, 

This contradicts the assumptions of the lemma, and so we are done. □ 



5. Using the Inequality 

All that now remains to be done is to find a function that satisfies the con- 
ditions of Lemma 4.1. The basic idea is to note that if we choose a continuously 
differentiable function a : [0, oo) — > [0, oo), then the function 

f(k, I) = exp{-a(l/k) \og(k + /)} 

satisfies the equation 

kj' + W = a(l/k), 

where by 7' and 5' we mean the derivatives of — log f(k,l) with respect to k 
and I. 

To use this fact we will choose a function a r which is everywhere less than 
or equal to the function n r , where 

f if 0<ar<l-i; 

K r (x) = I r -^x if 1 - \ < x < 1; 
^ K r (l/x) if X > 1. 

and which, moreover, is twice-differentiable. The specific function, if it chosen 
appropriately, will then be such that the true 7 and 5 differ by very little from 
7' and 5' for k and I chosen quite large, and this will allow us to conclude, for 
a suitably chosen a r , that 

r — 3 

kj + 16 < —^—K r (l/k). 

It is easy then to check that for some large multiple of the function a r the 
conditions of Lemma 4.1 are satisfied. 
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The first step in formalising this argument is to define an appropriate 
collection of functions a r , which we do as follows: 

Notation. Let r > 4 be a positive integer. We write (5 r : [0, 1] — ► [0, oo) 
for the polynomial function given by 

0r(z) = 6z 5 - 15z 4 + 10z 3 , 

and a r : [o, oo) — > [0, oo) for the function given by 



a r (x) 



' if0<x<l-i; 
r -^p r (2rx - (2r - 1)) if 1 - £ < x < 1; 
a r (l/x) if ar > 1. 



This slightly bizarre looking set of functions is chosen just so as to satisfy 
the following simple lemma: 

Lemma 5.1. For all r > 4, a r is a twice- differentiable function such that: 

1. for < x < 1, < a r (x) < r -^x; 

2. \a' r {x)\ < r 2 and \a"{x)\ < 20r 3 for all x. 

Before we start into the next lemma, we will again need some notation: 
Notation. Suppose that r > 4 is a fixed positive integer. We then write 
4> r (k, I) = a r (l/k) log(k + I). 

Our aim now is to show that f r = exp(— <f> r ) (or rather some large multiple 
of it) is an admissible function. The first step towards this is contained in the 
following lemma (note that this is essentially the same as Lemma 4 in [T88]): 

Lemma 5.2. For k and I greater than or equal to 200r 10 ; the inequalities 
exp{(j) r (k, I) — 4> r (k — m, I)} < 1 + mT 



and 



where 



exp{cj) r (k, I) — (j) r (k, I — m)} < 1 + mA, 



ct r {l/k)- - a r (l/k) + 



k + l rKI ' k 2 A{k + l) 

A = aAl/k)-^ + aUl/k)^±^ + 1 



k + l ryiJ k A{k + l) 

hold for m = 1, 2 and r — 1. 
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Proof. If we regard 4> r (k,l) as a function of k with I fixed, then we have, 
using Taylor's Theorem and the fact that 4> r is twice differentiable, that 

(f> r (k, I) - 4> r (k -m,l) = m^-(k, I) - ^-gj^r( k ~ dm > 

for some 6 between and 1. Now we have that 

d<f>r niu s 1 , l\og(k + l) 
— = a r (l/k) WJ -a r (l/k) k2 , 

and 

d 2 cb rn 1 , / , ilog(fc + Z) 



.(fc, |) = -a r (l/k)— — -2 - 2a' r {l/k) -^ JZ ■ A + 2a' r (l/k)- 
~ + 1) 

+a';(l/k)- 



Z 2 log(fc + /) 



fc 4 

Now note (by using part 2 of Lemma 5.1) that, for k and I both greater than 
or equal to 200r 10 , \^f{k,l)\ is less than or equal to 4r ^ + ^ ■ 
Therefore, in this case, we have that 

Mk,l) ~ Mk -m,l)<m L^l/k)^ - a' r {l/k) + + ^^y) • 

For brevity let's call the right hand side mi, noting that for k and I greater 
than or equal to 200r 10 , mx < 1. 

Therefore, using the fact that e z < 1 + z + z 2 for \z\ < 1, we see that 

exp{(f> r (k, I) — cf) r (k — m, I)} < 1 + mx + m 2 x 2 . 

Note then that, as for the second derivative, by taking k and I larger than 
200r 10 , we can make rx 2 smaller than 8 ^ +l ^ ■ Therefore, adding everything 
together, we see that 

2/ ,,/M 1 /„/M*log(fc + Z) , 1 



x + mx 2 < a r (l/k)- - a' r (l/k) — ^ + 



k + l ry ' ' k 2 4(fe + 0' 

which yields the required result. The result follows similarly for I. □ 

We are now ready to tie together everything we have learned in the pre- 
ceding sections to prove a theorem improving the general upper bound for 
Ramsey numbers. This theorem is as follows: 

Theorem 5.1. Let r > 4 be a fixed positive integer. Then there exists a 
constant c such that 



r(k + 1,1 + 1) < r cr ex.p{-<f> r (k, I)} 



k + l 
k 
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Proof. To begin let us suppose that f r is a function of the form f r (a, b) = 
Cexp{— 4> r (a, b)} for some fixed constant C, and let n = [fr(k,l)( k ^ 1 ) J = 
/*(fc,/)( fc ^), say. Suppose also that k t : [0, oo) — ► [0, oo) is the function given 
by 

j- if 0<ar<l-i; 

Kr(x) = I r ~^X ifl-i<X<l; 

[ K r (l/x) if X > 1. 

Then, by Lemma 5.1, since a r (x) < *^-x, we see that, for 1 > x > 1 — we 
have 

K r (x) > Q r (x) + -. 

If we now choose fc and I to both be greater than 200r 10 , we can apply Lemma 
5.2 to see that 

f r (k-ml) = exp | 0r(A;) ^ _ _ m ^ )} < x + mr> 

where 

T<aAl/k) T ^-aUl/k) l -^^ + 1 



fc + i rw ' k 2 4(k + l)' 

Furthermore, we have that 

f r (k — mA) ( \\ f r (k — m,l) 

< 1 + - , „ ' < 1 + m 7 , 



#(M) - V n/ /r(M) 
where 

/, n 1 / /, -> I log(/c + 1) 1 
7 < u r {l/k)- - a' r {l/k)—^- 9 '- + 



k + l rw ' k 2 2(k + l)' 

Similarly, we have that, for k and I both larger than 200r 10 , 
fr(k,l-m) <f 1 + lVr(M-m)^ 1 + 



#(M) - V nj f r (k,i) 

where 

5 < oc r {l/k)-\- + a;.(Vfc) l0g( ^ + /) + ' 



& + Z rw 7 fc 2(fc + Z)' 

Note therefore that 

kj + lS < a r (l/k) + - < K r (l/k), 

provided that 1 > min(|, j) > 1 — For min(^, j) < 1 — we have, provided 
k and I are large (again 200r 10 will easily suffice), that f r (a, b) is constant (equal 
to 1) close to (a, b) = (k, I) and so we again have 

kj + 15 < K r (l/k). 



DIAGONAL RAMSEY NUMBERS 



21 



Finally, choose k and I to be sufficiently large, greater than r cr , for some 
appropriate c, such that Lemma 4.1 holds in the following form: suppose that 
for m = 1, 2 and r — 1, each of the inequalities 

r k — m + P 



r(fc + 1 - m, Z + 1) < /(fc - m, Z) 



r(/c + 1, Z + 1 - m) < f(k, I - m) 



k — m 

k + 1 — m 
k 



f(k — m,l) , f(k,l — m) 

holds. Suppose also that |7| and \S\ are both smaller than r^j^- and that 
f(k,l) > exp{— rj: log A;}. Then, provided that 

&7 + 15 < n r (l/k), 

we have that 

r(k + l,l + l)<f(k,l)^ 1 

To conclude, suppose that N > max(200r 10 , r cr ) = r cr , for c chosen large 
enough, and consider the function f r (a,b) = (2N) r exp{— (p r (a, b)}. For either 
a or b less than or equal to N we have straightforwardly that for a > b with 

b < N, 



Ua,b) > - { - 2N }\, > 1 



using the fact that (a + b) h / a is a decreasing function in a. Now, both 7 and 
5 defined above are less than or equal to r^-f-^, and f(k,l) is certainly larger 
than exp{— log k}. Finally, we have by the construction of (p r and the choice 
of N that 

kj + 15 < K r (l/k). 

Consequently our induction holds good with this function f r . □ 

Theorem 1.2 is now a straightforward consequence of this theorem. The 
simple proof is in fact contained in the following proof of Theorem 1.1: 

Proof of Theorem 1.1. From Theorem 5.1, we know that, for integers 

r > 5, 

f2k s 

r(k + 1, k + 1) < r cr exp{-<p r {k, k)} " 



< r cr2 exp{-^— - log(2fe)} ' 



k 

k 



r cr2 (2k 



~ k dr \k 



22 



DAVID CONLON 



for some fixed constants c and d. 

If now, for any sufficiently large k, we take r = L 2e fogfogfc J ( a vaiUe which 

cr- 2 

is close to that which minimises j^zf), we see that for some constant C we have 

s-~i log k 

r(k + 1, k + 1) < k ^logiogfe 
as required. □ 
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